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Abstract. This paper presents some finite combinatorics of set systems 
witli appiications to modei tlieory, particulariy tlie study of dependent 
theories. There are two main results. First, we give a way of producing 
lower bounds on VCind-density, and use it to compute the exact VCind- 
density of polynomial inequalities, and a variety of geometric set families. 
The main technical tool used is the notion of a maximum set system, 
which we juxtapose to indiscernibles. In the second part of the paper 
we give a maximum set system analogue to Shelah's characterization of 
stability using indiscernible sequences. 



1. Introduction 

In the recent past there have been a number of papers relating various 
measures of the combinatorial structure of NIP theories to one another [H O 
[7] . One fact which emerged from this is the close relation of dp-rank and VC- 
density restricted to indiscernible sequences. Guingona and Hill have used 
the term VCind-density to describe VC-density restricted to indiscernibles. 
At the end of their paper Guingona and Hill ask if there is a useful 
characterization of when a formula has VCind-density equal to VC-density. 
We offer such a characterization below (Corollary [33]), and use it to compute 
the exact VCind-density of certain formulas. 

A separate goal of this paper is to show how maximum set systems can in 
many cases be used as more accessible surrogates for indiscernible sequences. 
To this end we translate Shelah's well-known characterization of stability in 
terms of indiscernible sets to a version involving maximum set systems. 



2. Notation 

Let there be a fixed complete theory T, with a large saturated model 
Tl = (M, . . .). All sets and models, unless otherwise stated, are assumed to 
be elementarily embedded in the model 9Jt. We write formulas in partitioned 
form ¥5(x,y), where x = (xi, . . .,xi), and y = {yi, . . .,yk). 

For A C Mlyl and b G mI^'I, 
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We use S^p{A) for A C m'^' to denote the set of 93-types over A, where a 
(^-type over j4 is a maximal consistent set of the form {ib93(x, a) : a G M'^'}. 
We let S^{A)\b = {tp^{h/A) : b € 5} when B C mI'^L We sometimes 
identify S^p{A)\b and {C C A : C = (/^(b,^), for some b G 5}, as in the 
following definition. For a set X, let denote the power set of X. 

Definition 2.1. If A C mI^' and C C V{A), we will say that ip{x, y) traces 
C if for some 5 C il^^l, S^{A)\b = C. 

We now give some purely combinatorial definitions. For the rest of this 
section suppose X is a set and C C V[X). For A Q X, let = {C fl A : 
C £ C}. Say that C shatters A if C\^ = ViA). 

Definition 2.2. The Vapnik-Chervonenkis (VC) dimension of C, denoted 
VC{C), is \A\ where A Q X is of maximum cardinality such that C shatters 
A. 

If the VC-dimension of C does not exist, we write VC(C) = 00. The 
VC dimension was first considered in [16] and was introduced into model 
theory by Laskowski [llj. The following notion of a maximum VC family 
was defined by Welzl [T7]. 

Definition 2.3. Say that C is d-maximum for d £ to if both of the following 
hold. 



(1) VC{C) = d 

(2) For any ACX, finite, , - y^^j 



Here (.T^) is shorthand for ^ 



i=0 



Lemma 2.1 (Sauer's Lemma [13\ fUj). If C has VC{C) = d, then for any 
finite A C X , 



\C\^\ < 



< d 



Thus a d-maximum set system is "extremal" among set systems of VC- 
dimension d. These set systems are highly structured, and well-understood 
[H [To] . There are several examples that arise from natural algebraic situa- 
tions. In fact it is conjectured that all d-maximum set systems arise from 
(or embed naturally in) arrangements of half-spaces, either in a euclidean 
or hyperbolic space [T2] . 

It is easy to see that if C is d-maximum on X, and A Q X has \A\ = d + 1, 
then 

= \ {C} 

for some CCA. Floyd [4| calls such a C the forbidden label of C on ^. 
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Let [X]"" := {A C X : \A\ = m}. For a fixed d-maximum C C P(X), 
associate each A € [X]'^^"'^ witli tlie forbidden label Ca C A, where C\^ = 

nA)\{CA}. 

Floyd proves the following. 

Proposition 2.2. On a finite domain X , any d-maximum set system C is 
characterized by its forbidden labels, in the sense that \/B C X, 

B eC ^ yAe [X]'^+\B nA)^CA 

Proof. Left to right is obvious. For right to left, let B satisfy the given 
conditions. Then C U {B} shatters no sets not shattered by C. By Sauer's 
Lemma, B must already be in C. □ 

We now define the notion of a forbidden code, which is essentially the 
"form" of a forbidden label, when an ordering is present Q 

Let £c(^) = {Ca G [X]^^+i : A G [Xf+^} denote the forbidden labels 
of C on X. Let < be a fixed but arbitrary linear order on X. For each 
Ca € ^c{X), let Ca = {to, ■ ■ ■ ,td), where each ti € {0, 1}, and tj = 1 if and 
only if the ith elem ent of A is in Ca- Define £c(^) = (Cl : Ca G £c(^)}- 

We will refer to Zc{X) as the set of forbidden codes on X for C, with 
respect to <. 

There is a natural way in which forbidden codes can serve as combinatorial 
invariants for finite unions of points and <-convex subsets in X. To see this, 
suppose {X, <) is a dense and complete linear order, and B C X is a finite 
union of convex subsets. Let d be the number of boundary points of B. We 
can imagine that B is defined by some L< = {<} formula ^{x,ci, . . . ,Cd) 
with ci < ■ ■ ■ < Cd £ X. Let T{B) = {tp{X, c[, ■ ■ ■ , c'^) : c[ < ■ ■ ■ < c'^ e X}. 
Intuitively the elements of J~{B) C V{X) are the "homeomorphic images" 
of B in (X, <). In [8] we show that J~{B) is finitely characterized by some 
r] G 2'^'^^. Recall that being finitely characterized by rj means that for any 
finite Xq C X, and any A C Xq, there is C G J^iB) such that C n Xq = A 
if and only if there are not ao < • • • < in Xq such that a, G A if and only 
if r]{i) = 1. 

Define the genus of B, denoted G{B), to be the rj G 2'^"'"^ that finitely 
characterizes J'{B). Equivalently, define G{B) to be any rj G 2^^"*"^ such that 
there are no < • • • < in X such that G -B if and only if r/(i) = 1, for 
all i = 0, . . . , d. 

We will take it to be established (see iSj) that such an rj exists, and is 
unique, as well as the further fact that for any rj G 2^^^ there is some i? C X 
such that G{B) = rj. Simple rules for computing genus are given in Tabled! 

To give an example of applying the table, suppose X = M, and < is the 
usual ordering. Then the genus the point {0} is (11), and the genus of the 
interval (0, 1) is (101). Conversely, to consider all subsets of M with genus 



In we did not distinguish between forbidden codes and labels, but referred to both 
using the same term. 
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code 


translation 


(1...) 




do nothing 


(0...) 




(-00,... 


(...0,0. 


■ ■> 


remove point 


(...0,1. 


■•) 


end interval 


(...1,0. 


••) 


begin interval 


(...1,1. 


• ■) 


add point 


(...0) 




...,oo) 


(...1) 




do nothing 



Table 1. A key for assigning forbidden codes to unions of 
convex sets. 

(11), let C be all the singletons. Similarly, the collection of all subsets of 
genus (101) is exactly the set of all infinite convex subsets which are not 
coinitial or cofinal. 

The assumption that {X, <) is complete was made to give a clear pre- 
sentation of the genus concept, and is sufficient for this paper. One can, 
however, define the genus of C X on other orders by considering the 
shortest 77 G 2^'^ which B does not induce, sidestepping the issue of bound- 
ary points. 

The link between genus and forbidden codes is given in the following 
theorem. 

Theorem 2.3. 

Suppose that (X, <) is a complete dense linear order without endpoints. If 
r] G 2*^+^, and C = {C C X : G(C) = 77}, then C is d-maximum on X and 
MX) = {r,}. 

Proof. We provide a sketch of the proof that C is d-maximum, which is very 
similar to the well-known proof that Tinions of intervals are maximum. 

Let XqCX he finite, and C = {C C X : G(C) = ry}. Let a := max{Xo} 
and Xg = Xo\ a. Define C« = {C G CfS : C U {a} G Cf° & C G C\^°}. 
Show, by induction on |Xo| and d, that is d — 1 maximum and C\'^<^ is d- 
maximum. Then show |C|^o| = |C°| + \C\^ol and that, by Pascal's identity, 
|C|Xo| = (|Xol). 

□ 

3. Results 

3.1. VCm and VCjnd-density. We now apply the above to achieve our 
results. Recall the following definitions. 

Definition 3.1. A formula ip{x, y) has VC-dcnsity < r for r € if there 
is K E uj such that for every finite A C ^^^l, \S^{A)\ < • We denote 
this by VCd{(p) < r. 
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Definition 3.2. A formula (p{x, y) has VCind-density < r for r G M if there 
is K ^ 00 such that for every finite and indiscernible A C mI^', < 
K ■ \A\^' . We denote this by VCdindi'^) ^ 

The study of VC-density has emerged several times in model theory. See 
[1] for historical remarks. 

There has been some study of the fact that frequently VCd(v9) is bounded 
by a simple (and uniform) function of |x| [HE]. When this is true, it justi- 
fies the heuristic practice of "parameter counting" to guess the complexity 
of set systems. Guingona and Hill showed that in a dp-minimal theory 
VCdind(v') ^ Thus there is interest in bounding YCd{ip) by a function 
of VCdind(v') (obviously VCd((/?) > VCdind('/')-) This may not be possible 
in general, but we now show a practicable route to achieving it for a given 
formula. 

Definition 3.3. 

Tr{ip,]\Sy\) := {C:3AQ A^2'I,C C V{A),^p traces C} 

The following is easy, but interesting, because the collection of d-maximum 
set systems would seem, a priori, to be very diverse. It also informs Defini- 
tion [331 

Lemma 3.1. For each d £ oj, there is some C € Tr((/9, Afl^') such that 
\C\ > ^^o and C is d-maximum if and only if for each n £ uj, n > d, there is 
Cn G Tr{ip, ikfl^l) such that \ |JC„| = n and Cn is d-maximum. 

Proof. For the right to left direction, it is easily seen that the property of 
being d-maximum is elementary. Apply the compactness theorem and the 
saturation of 9Jl. For left to right, note that if C is d-maximum and X' (1 X 
with \X'\ = n, n > d, then C\^ is d-maximum. □ 

Definition 3.4. 

VCm{ip) := max{d € w : 3C G Tr{ip, m'^'), \C\ > ^^0)C is d-maximum} 

If (aj)jg7 is a sequence of indiscernibles and (/, <) is a complete and dense 
linear order without endpoints, for B C (aj)^^/, define G{B) to be the genus 
of {i G / : a, G B} C I. With these assumptions, for m £ oj, define 
Pm{B) = {p G 2- : 3io < • • • < im-i G I : £ B ^ p{j) = 1}. 

If /i G 2^^, r/ G 2' and / < fc, write r] < fi li rj \s a. subsequence of /i, 
meaning that for some order preserving embedding f : I k (where k and 
/ are regarded as ordinals) Vi G l,fj,{f{i)) = rj{i). 

Lemma 3.2. // {ai)i^j is a sequence of indiscernibles and (I, <) is a com- 
plete and dense linear order without endpoints and B C {ai)i^j, is such that 
G{B) = r] for some rj G 2'^~^^ then 

(1) \P^{B)\ = {Z) 

(2) VpG2'»,pGP„(5) ^ rj^p. 
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Proof. The first claim is a corollary of Theorem 12.31 To see why, consider B, 
together with some A C (aj)ig/ such that 1^41 = m. Theorem 12.31 describes 
what dichotomies on A arise as the endpoints of B are moved in an order- 
preserving fashion. But, dually, one can imagine that the endpoints of B are 
fixed, and that the points in A move. Claim (1) follows from the theorem, 
seen from the latter point of view. The second claim follows from the same 
theorem, together with an application of Proposition 12. 2[ □ 

Assume that the formula f{x,y) is NIP in the following lemma. 

Lemma 3.3 (Transfer Lemma). // {cLi)i^i is a sequence of indiscernibles 
and {I, <) is a complete and dense linear order without endpoints and B C 
{oiji^i is defined by {p{{ai)i^i, c) and A C {ai)i^j is finite and A' C A can 
be traced as A' = A D B' where B' C {ai)iQj is such that G{B') = G{B), 
then for some c' £ ikfl^l, ip{A, c') = A'. 

Proof. Let m = \A\, and suppose ajg < • • • < aj^_-^ is an enumeration of A. 
Since G{B') = G{B), we have Pm{B) = Pm{B') by LemmaOpart (2). Let 
P £ Pm{B') be such that for each j = 0, . . . , m — 1, a^^. G A' -4=^ p{i) = 1. 
Then p G Pm{B), and for some a^^ < • • • < in {ai)iei, 

m—l 
3=0 

and thus 

m— 1 
j=0 

Then, by indiscernibility, 

m— 1 

j=o 

and the witnessing c' is the desired parameter. □ 

In the proof of the following theorem, for p € S^{A), we identify p and 
B = {a £ A : (/^(x, a) € p} without further comment. 

Theorem 3.4. For any ip{x,y), VCm{ip) = VCdind{y^)- 

Proof. First note that we always have YCm{ip) > and VCdind('/') > 0. 

Now suppose VCdind(¥') ^ d, for some positive d £ u. Let < e < 1/2. 
By compactness, Ramsey's theorem, and saturation of the monster model, 
there is some indiscernible sequence (aj)jg]K such that [^^(yl)! > |^|'^~'^ for 
arbitrarily large finite A C (ai)jgiR. 

Claim: 3B G 5'^((aj)igK) with lg{G{B)) =d+l. 
First we argue that there is B £ 5(^((aj)jg]R) with lg{G{B)) > d+1. Suppose, 
to the contrary, that VB £ S^{{ai)i^M.), we have lg{G{B)) < d. Then a 
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counting argument gives K ^ to such that for every finite A C (aj)jgR, 
|S'^(yl)| < K ■ 1^1'^"-'^, violating the hypothesis on (aj)jg]R. 

Second, note that if there is some B G 5<^((ai)ieK), with lg{G{B)) > d+1, 
then by compactness and saturation there is some B' G ^(^((aj)^^]^), with 
lg{G{B')) = d + 1. This proves the claim. 

Now take B as in the claim. It follows from the Transfer Lemma that on 
any finite A C (ai)jeK, 

{B'nA-.B'C (a,)ieM,G(S') =G(S)} e Tri^M""^) 

This implies, by Theorem [231 that (p traces arbitrarily large d- maximum set 
systems, and, by Lemma l3.ll Tr{ip,M^^^) contains an infinite d-maximum 
set system. Thus VCm((^) > d. 

To show the other direction, suppose VCm((^) > d. By compactness, 
saturation, and Ramsey's theorem (or, alternatively, by Erdos-Rado) there 
is an infinite indiscernible sequence A = (aj)^^^^ on which (p traces a d- 
maximum set system. It follows from the definition of d-maximum that 
Sip{A) witnesses that VCdind(v) ^ d. □ 

It should be noted that Guingona and Hill prove that VCdind((/?) is equal 
to several other invariants, among which VCm((/j) may obviously be in- 
cluded. 

An immediate corollary is the following. 

Corollary 3.5. For any formula Lp{x,y), VCd{ip) = VCdindi'p) if and only 
if ip traces an infinite d-maximum set system, where d = VCd{ip). 

This condition is easier to use in practice than a direct appeal to a noncon- 
structive combinatorial principle such as the Ramsey or Erdos-Rado theo- 
rem. We give an algebraic example in the theory of real closed ordered fields 
(RCOF). Though we make efforts to be precise in the following statements, 
we are just considering a definable family that results from a polynomial 
inequality where the coefficients form the parameter set. 

Without loss we work in M. Let y be a finite sequence of variables. For a 
given m G cj, let denote the set of monomials which occur in the general 
polynomial of degree m with variables y. Let S C Ym be chosen, and put 
d = \S\. Let M[y]5 denote the polynomials with real coefficients all of whose 
constituent monomials occur in S. Consider C = {pos(p) : p G M[y]s}, 
where pos(p) = {a G M'^' : p(a) > 0}. 

Such a C can clearly be traced by some (^(x, y) G Lring, in M ^ RCOF. It 
is known (see Floyd [4], section 8.1) that for such a 99, we have VCm(99) > d. 
As it is well-known that VCd((/?) = d, we have VCdind(</') = VCd((/?) for 
polynomial inequalities if. 

Floyd's result is based on an application of Dudley's theorem ([3], Theo- 
rem 4.2.1), which can apply to somewhat more general situations (see Ben- 
David and Litman [2]). 

Many familiar geometric families, such as circles, ellipses, half-spaces, 
hyperbolas, etc., therefore have VCjnd-density equal to VC-density. The 
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above approach notably does not apply to geometric families which are not 
polynomial definable (in the above sense) such as axis-parallel rectangles, or 
convex d-gons, though such results may follow from other means. 

3.2. Stability. Here we show how to characterize the stability of (p using 
the maximum systems in Tr((/9, M'^'). For a review of the relevant notions 
from stability theory see [6]. 

Recall that the ladder dimension of a formula (/^(x, y) is defined by writing 
LD((/9) > n if and only if there are ag, . . . , a„_i in M'''! and bo, . . . , b„_i in 
M'^' such that 93t |= (^(aj,bj) <;=^ i < j. Finite ladder dimension is 
equivalent to stability for formulas. The VC-dimension can be thought of 
as a generalization of ladder dimension, and in general coVC{ip) < LD 
where coVC((/3) denotes the VC-dimension of 5^(mI^I) conceived as a set 
family. 

For a set X and C C define a graph Gc = {V,E) where V = C 

and E{Ci,C2) holds if and only if jCiACal = 1. For Ci,C2 G C, define 
disth{Ci,C2) to be the Hamming distance IC1AC2I, and distg^ (Ci, C2) to 
be the graph distance in Qc, with distg^ (Ci, C2) = 00 if Ci and C2 belong 
to different components. 

The following was proved by Warmuth and Kuzmin [lOj, (Lemma 14). 

Lemma 3.6. Let X be a finite set. Suppose C C V{X) is d-maximum, and 
Ci,C2 e C. Then 

disth{Ci,C2) = distg^{Ci,C2) 
In particular, Qq is connected. 

The equivalence of disth and distg^ is clearly still true when X is infinite, 
though the graph Qc will not be connected in general. In fact, in many 
natural maximum set systems (for example, open intervals on a densely 
ordered set), Qc is totally disconnected. 

Theorem 3.7. '^{x^ y) is a stable formula with LD{ip) < n if and only if 
for every C £ Tr{if,M^'^^) which is d-maximum for some d £ to, for any 
Ci,C2€C, \C1\C2\ <n. 

Proof. First suppose (^(x, y) is a stable formula with LD(ip) < n. 

Let Ci, C2 € C, where C € Tr{ip, M'^') is d-maximum for some d € We 
show |Ci \ C2I < n. 

Let A <Z Ci\C2 where A = {oi, . . . ,0^}. By Lemma [3T6l after possibly 
reordering A, there are B^, -Bfc„i, . . . , Si in C, on the path from Ci to C2 in 
Qc\A-, such that ai € Bj iff i < j. Thus k < n, and consequently |Ci\C2| < n. 

Conversely, suppose LD((^) > n. Let A = {ao, . . . ,a„} C m'^'I and B = 
{bo,..., b„} CMlyl such that 9K^(/?(ai,bj) i<j. Fut C = S^{B)\a. 

Then by Theorem 12. 3^ C is 1-maximum, and |-B\0| = \B\ = n + 1. 

□ 
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The above theorem shows that much of the nature of Shelah's famous 
characterization of stable formulas by indiscernibles (see [15]) is already 
visible at the level of maximum traces. Unspooling the theorem reveals a 
structural characterization of stable maximum set systems, as we now show. 

If X is a set, and C C ■p(X), define the ladder dimension of C to be the 
maximal n € w such that there are xi, . . . , x„ G X and Bi, . . . , Bn G C with 
Xi G Bj if and only if f < j. Say that C is stable just in case it has finite 
ladder dimension. 

UACX, define CAA = {CAA : C G C}. 

Lemma 3.8. If C T^i^) has LD{C) = n, then for any A C X , we have 
LD{CAA) < 2n, and this bound is tight. 

Proof. For brevity, we show only the tightness of the bound. Fix n ^ u 
X be the integers between —n and n, inclusive, but not including zero 
C = {[0, i] n X : < i < n} U {[-i, 0] n X : < i < n}. Clearly LD(C) 
But LD(CA[-n, -1]) = 2n. 

Note that the example in the above proof is 1-maximum. 

Corollary 3.9. Let C C V{X) be d-maximum of ladder dimension n. 

(1) If^eC, thenC^ [X]^". 

(2) CAB C [X]^2n any B eC. 

(3) C C [X]^2n^^ any B eC. 

Proof. The claim in (1) is clear from Theorem 13. 71 For the claim in (2) note 
that G CAB (because B £ C), and the ladder dimension of CAB is at 
most 2n by Lemma ESI Therefore CAB C [X]^^" ^^^^ qj^^j^^ ^-g-j fgHows 
after applying AB to both sides of the containment in (2). □ 

Clearly every finite maximum set system is stable, and so the above says 
something about the structure of finite maximum classes. 

The 2n bound in Corollary 13.91 part (2) is tight, as the following example 
shows. Let X = {1, ... , 2n}, andC = {D : D £ [X]^"}. Clearly LD(C) = n. 
Now putting B = {1, . . . , n} gives CAB an element of cardinality 2n. 

On the other hand, it seems possible that for some i? C X, it may be 
the case that CAB CI [X]-", where the hypotheses are as in Corollary 
13.91 However, since the hypotheses admit all finite maximum classes, this 
conjecture may be too optimistic. Such as result would clearly be the best 
possible. 

It is also evident from the above that the stable maximum set systems 
are exactly those maximum set systems C C V{X) which can be realized as 
C C [X]'"Ai? for some m £ uj and B C X (because the latter systems are 
clearly stable). 

It would be useful to know whether every C of ladder dimension n embeds 
into a 0(n)-maximum set system C' . See [2] for relevant embedding notions. 
If this were true, it would have interesting applications to definability of 
types and VC-density. 



. Let 
. Let 

= n. 

□ 
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4. Conclusion 

In model theory, much of the combinatorial content comes from consider- 
ing formulas restricted to indiscernible sequences. The existence of sequences 
of indiscernibles is guaranteed by Ramsey's theorem (and compactness), 
though it is rarely required to exhibit a concrete sequence of indiscernibles. 

When dealing with a certain formula (/^(x, y) on a sequence A = (aj)^^/ 
it is a weaker condition to assume that (/? is maximum on A than to assume 
that A is indiscernible. However, as we have seen, if ip is maximum on A, 
that provides "enough" indiscernibility for some combinatorial notions to 
come through. Namely, dp-rank, NIP, and stability can all be understood 
in terms of the maximum property. 

Unlike indiscernible sequences, maximum domains are frequently easily 
constructible. In the semilinear case, it follows from the work of Floyd and 
Dudley that a basic semilinear family will be maximum on a set of points 
which is in "general position," for which it is sufficient to take a randomly 
selected set of points. 

Moreover, the similarity of maximum families and formulas on indis- 
cernible sequences provides a point of contact between work done in com- 
putational learning theory and model theory, where, especially recently, re- 
searchers are pursuing compatible combinatorial goals, but without a com- 
mon framework. 
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